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(1) GENERAL INFORMATION: 

(i) APPLICANT: Yu, Guo-Liang 
. Rosen, Craig 

(ii) TITLE OF INVENTION: Colon Specific Genes and Proteins 

(iii) NUMBER OF SEQUENCES : 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Carella, Byrne, Bain, Gilfillan, Cecch 

Stewart & Olstein 
<B) STREET: 6 Becker Farm Road 

(C) CITY: Roseland 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP : 07068-1739 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/469,667 

(B) FILING DATE: 06-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Ferraro, Gregory D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE/DOCKET NUMBER: 325800-435 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 201-994-1700 

(B) TELEFAX : 2 01-994-1744 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 8 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . .501 



1 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .501 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GCC AGG CAG CTG GCT GCC SAC CAG GCC GTG TAT GTG AAG GTC AAG GCT 4 8 

Ala Arg Gin Leu Ala Ala Xaa Gin Ala Val Tyr Val Lys Val Lys Ala 
15 10 15 

GAA GCC CGG GAA CTG CTG GGC CAC CCG TGG TCT CTG TGT CCT GTC TGT 96 
Glu Ala Arg Glu Leu Leu Gly His Pro Trp Ser Leu Cys Pro Val Cys 
20 25 30 

GGG TGC CAA CTC ACC ACC TTT GAT GGG GCC CGT GGT GCC ACC ACT CTC 14 4 

Gly Cys Gin Leu Thr Thr Phe Asp Gly Ala Arg Gly Ala Thr Thr Leu 
35 40 45 

CTG GTG TCT ATG AAG CTC TCT TCC CGC TGC CCA GGA CTA CAG AAT ACC 192 
Leu Val Ser Met Lys Leu Ser Ser Arg Cys Pro Gly Leu Gin Asn Thr 
50 55 60 

ATC CCC TGG TAC CGT GTA GTT GCC GAA GTC CAG ATC TGC CAT GGC AAA 24 0 

lie Pro Trp Tyr Arg Val Val Ala Glu Val Gin lie Cys His Gly Lys 
65 70 75 80 

ACG GAG GCT GTG GGC CAG GTC CAC ATC TTC TTC CAG GAT GGG ATG GTG 2 88 

Thr Glu Ala Val Gly Gin Val His lie Phe Phe Gin Asp Gly Met Val 
85 90 95 

ACG TTG ACT CCA AAC AAG GGT GTG TGG GTG AAT GGT CTC CGA GTG GAT 336 
Thr Leu Thr Pro Asn Lys Gly Val Trp Val Asn Gly Leu Arg Val Asp 
100 105 110 

CTC CCA GCT GAG AAG TTA GCA TCT GTG TCC GTG AGT CGT ACA CCT GAT 3 84 

Leu Pro Ala Glu Lys Leu Ala Ser Val Ser Val Ser Arg Thr Pro Asp 
115 120 125 

GGC TCC CTG CTA GTC CGC CAG AAG GCA GGG GTC CAG GTG TGG CTT GGA 432 
Gly Ser Leu Leu Val Arg Gin Lys Ala Gly Val Gin Val Trp Leu Gly 
130 135 140 

GCC AAT GGG AAG GTG GCT GTG ATT GTG AGC AAT GAC CAT GCT GGG AAA 4 80 

Ala Asn Gly Lys Val Ala Val lie Val Ser Asn Asp His Ala Gly Lys 
145 150 155 160 

CTG TGT GGG GGC CTK TGG AAA ATTTGACGGG GGACCAGACC AATGATTGGG 531 
Leu Cys Gly Gly Xaa Trp Lys 
165 

ATGATTCCCA GGAGAAGCCA GCGATTGGGG AAWTGGAGAG CGCAGGGACT TTCTYCCMCA 591 
TGTTAATGGG C TTG WT C CAG TTCATCCCAC CAGGAACGAA GGATTTT 63 8 



(2) INFORMATION FOR SEQ ID NO : 2 : 



2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ala Arg Gin Leu Ala Ala Xaa Gin Ala Val Tyr Val Lys Val Lys Ala 
15 10 15 

Glu Ala Arg Glu Leu Leu Gly His Pro Trp Ser Leu Cys Pro Val Cys 
20 25 30 

Gly Cys Gin Leu Thr Thr Phe Asp Gly Ala Arg Gly Ala Thr Thr Leu 
35 40 45 

Leu Val Ser Met Lys Leu Ser Ser Arg Cys Pro Gly Leu Gin Asn Thr 
50 55 60 

lie Pro Trp Tyr Arg Val Val Ala Glu Val Gin lie Cys His Gly Lys 
65 70 75 80 

Thr Glu Ala Val Gly Gin Val His lie Phe Phe Gin Asp Gly Met Val 
85 90 95 

Thr Leu Thr Pro Asn Lys Gly Val Trp Val Asn Gly Leu Arg Val Asp 
100 105 110 

Leu Pro Ala Glu Lys Leu Ala Ser Val Ser Val Ser Arg Thr Pro Asp 
115 120 125 

Gly Ser Leu Leu Val Arg Gin Lys Ala Gly Val Gin Val Trp Leu Gly 
130 135 140 

Ala Asn Gly Lys Val Ala Val lie Val Ser Asn Asp His Ala Gly Lys 
145 150 155 , 160 

Leu Cys Gly Gly Xaa Trp Lys 
165 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOG Y : 1 i ne ar 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 705 
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(ix) FEATURE : 

(A) NAME /KEY : mat_pept ide 

(B) LOCATION: 1. .705 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CAG GAC TGC GTG TGC ACG GAC AAG GTG GAC AAC AAC ACC CTG CTC AAC 4 8 

Gin Asp Cys Val Cys Thr Asp Lys Val Asp Asn Asn Thr Leu Leu Asn 
1 5 10 15 



GTC ATC GCC TGC ACC CAC GTG CCC TGC AAC ACC TCC TGC AGC CCT GGG 96 
Val He Ala Cys Thr His Val Pro Cys Asn Thr Ser Cys Ser Pro Gly 
20 25 30 

TTC GAA CTC ATG GAG GCC CCC GGG GAG TGC TGT AAG AAG TGT GAA CAG 14 4 

Phe Glu Leu Met Glu Ala Pro Gly Glu Cys Cys Lys Lys Cys Glu Gin 

35 40 45 

ACG CAC TGT ATC ATC AAA CGG CCC GAC AAC CAG CAC GTC ATC CTG AAG 192 

Thr His Cys He He Lys Arg Pro Asp Asn Gin His Val He Leu Lys 

50 55 60 

CCC GGG GAC TTC AAG AGC GAC CCG AAG AAC AAC TGC ACA TTC TTC AGC 24 0 

Pro Gly Asp Phe Lys Ser Asp Pro Lys Asn Asn Cys Thr Phe Phe Ser 
65 70 75 80 

TGC GTG AAG ATC CAC AAC CAG CTC ATC TCG TCC GTT TCC AAC ATC ACC 288 
Cys Val Lys He His Asn Gin Leu He Ser Ser Val Ser Asn He Thr 
85 90 95 

TGC CCC AAC TTT GAT GCC AGC ATT TGC ATC CCG GGC TCC ATC ACA TTC 336 
Cys Pro Asn Phe Asp Ala Ser He Cys He Pro Gly Ser He Thr Phe 
100 105 HO 

ATG CCC AAT GGA TGC TGC AAG ACC TGC ACC CCT CGC AAT GAG ACC AGG 3 84 

Met Pro Asn Gly Cys Cys Lys Thr Cys Thr Pro Arg Asn Glu Thr Arg 
115 120 125 

GTG CCC TGC TCC ACC GTC CCC GTC ACC ACG GAG GTT TCG TAC GCC GGC 43 2 

Val Pro Cys Ser Thr Val Pro Val Thr Thr Glu Val Ser Tyr Ala Gly 
130 135 140 

TGC ACC AAG ACC GTC CTC ATG AAT CAT TGC TCC GGG TCC TGC GGG ACA 4 80 

Cvs Thr Lys Thr Val Leu Met Asn His Cys Ser Gly Ser Cys Gly Thr 
145 150 155 160 

TTT GTC ATG TAC TCG GCC AAG GCC CAG GCC CTG GAC CAC AGC TGC TCC 52 8 

Phe Val Met Tyr Ser Ala Lys Ala Gin Ala Leu Asp His Ser Cys Ser 
165 170 175 

TGC TGC AAA GAG GAG AAA ACC AGC CAG CGT GAG GTG GTC CTG AGC TGC 57 6 

Cys Cys Lys Glu Glu Lys Thr Ser Gin Arg Glu Val Val Leu Ser Cys 
180 185 190 

CCC AAT GGC GGC TCG CTG ACA CAC ACC TAC ACC CAC ATC GAG AGC TGC 624 
Pro Asn Gly Gly Ser Leu Thr His Thr Tyr Thr His He Glu Ser Cys 
195 200 205 
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CAG TGC CAG GAC ACC GTC TGC GGG CTC CCC ACC GGC ACC TCC CGC CGG 67 2 

Gin Cys Gin Asp Thr Val Cys Gly Leu Pro Thr Gly Thr Ser .Arg Arg 

210 215 220 

GCC CGG CGT TCC CCT AGG CAT CTG GGG AGC GGG TG AG CGGGGT GGGCACAGCC 72 5 
Ala Arg Arg Ser Pro Arg His Leu Gly Ser Gly 
225 230 235 

CCTTCACTGC CCTCGACAGC TTTACCTCCC CCGGACCCTC TGAGCCTCCT AAGCTCGGCT 78 5 

TCCTCTCTTC AGATATTTAT TGTCTGAGTT TTTGTTCAGT CCTTGCTTTC CAATAATAAA 84 5 

CTCAGGGGGA CATGCAAAAA AAAAAAAAA 8 74 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Gin Asp Cys Val Cys Thr Asp Lys Val Asp Asn Asn Thr Leu Leu Asn 
15 10 15 

Val He Ala Cys Thr His Val Pro Cys Asn Thr Ser Cys Ser Pro Gly 
20 25 30 

Phe Glu Leu Met Glu Ala Pro Gly Glu Cys Cys Lys Lys Cys Glu Gin 
35 40 45 

Thr His Cys He He Lys Arg Pro Asp Asn Gin His Val He Leu Lys 
50 55 60 

Pro Gly Asp Phe Lys Ser Asp Pro Lys Asn Asn Cys Thr Phe Phe Ser 
65 70 75 80 

Cys Val Lys He His Asn Gin Leu He Ser Ser Val Ser Asn He Thr 
85 90 95 

Cys Pro Asn Phe Asp Ala Ser He Cys He Pro Gly Ser He Thr Phe 
100 105 HO 

Met Pro Asn Gly Cys Cys Lys Thr Cys Thr Pro Arg Asn Glu Thr Arg 
115 120 125 

Val Pro Cys Ser Thr Val Pro Val Thr Thr Glu Val Ser Tyr Ala Gly 
130 135 140 

Cys Thr Lys Thr Val Leu Met Asn His Cys Ser Gly Ser Cys Gly Thr 
145 150 155 160 

Phe Val Met Tyr Ser Ala Lys Ala Gin Ala Leu Asp His Ser Cys Ser 
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165 



170 



175 



Cys 



Cys Lys Glu Glu Lys Thr Ser Gin Arg Glu Val Val Leu Ser Cys 
180 185 190 



Pro 



Asn Gly Gly Ser Leu Thr His Thr Tyr Thr His lie Glu Ser Cys 
195 200 205 



Gin 



Cys Gin Asp Thr Val Cys Gly Leu Pro Thr Gly Thr Ser Arg Arg 
210 215 220 



Ala 
225 



Arg Arg Ser Pro Arg His Leu Gly Ser Gly 
230 235 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATTGGTGCTA CCTGGCTCTC CTGTCTCTGC AGCTCTACAG GTGAGGCCCA GCAGAGGGAG 60 

TAGGGCTCGC CATGTTTCTG GTGAGCCAAT TTGGCTGATC TTGGGTGTCT GAACAGCTAT 12 0 

TGGGTCCACC CCAGTCCCTT TCAGCTGCTG CTTAATGCCC TGCTCTCTCC CTGGCCCACC 180 

TTATAGAGAG CCCAAAGAGC TCCTGTAAGA GGGAGAACTC TATCTGTGGT TTATAATCTT 24 0 

GCACGAGGCA CCAGAAGTCT CCCTGGGTCT TGTGAATGAA CTACATTTAT CCCCTTTCCT 3 00 

GCCCCAACCA CAAACTCTTT CCTTCAAAGA GGGCCTGCCT GGTTCCCTCC ACCCAACTGC 3 60 

ACCATGAGAT CGGTCCAAGA GTCCATTCCC CAGGTGGGAG CCAACTGTCA GGGAGGTCTT 42 0 

TCCCACCAAA CATCTTTCAG TTGCTGGGAG GTG AC CAT AG GGCTCTGCTT TTAAAGATAT 480 

GGCTGCTTCA AAGGCCAGAG TCACAGGAAG GACTTCTTCC AGGGAGATTA GTGGTGATGG 54 0 

AGAGGAGAGT TAAAATGACC TCATGTCCTT CTTGTC CACG GTTTTGTTGA GTTTTCACTC 60 0 

TTCTAATGCA AGGGTCTCAC ACTGTGAACC ACTTAGGATG TGATCACTTT CAGGTGGCCA 660 

GGAATGTTGA ATGTCTTTGG CTCAGTTCAT CTAAAAAAGA TATCTATTTG AAAGTTCTCA 72 0 

GAGTTGTACA TATGTTTCAC AGTACAGGAT CTGTACATAA AAGTTTCTTT C CT AAAC CAT 78 0 

TCACCAAGAG CCAATATCTA GGCATTTCCT CGGTAGCACA AATTTTCTNA TTGCTTAGAA 84 0 

AATTGTCCTC CCTGTTCTTT CTGTCTGNAG ACTTAAGTGA GTTAGGTCTT T AAGGAAAG C 90 0 
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AACGCTCCTC TGAAATGCTT GTCTTTTTTC TGTTGCCGAA ATAGCTGGTC CTTTTTCGGG 960 

AGTTAGATGT ATAGAGTGTT TGTATGTAAA CATTTCTTGT AGGCATCACC ATGAACANAG 1020 

AT AT ATTTT C TATTTANTTA NTATATGTGC ACTTCAAGAA GTCACTGTCA GAGAAATAAA 10 8 0 

GAATTGTCTT AAATGTCATG ATTGGAGATG TCCTTTGCAT TGCTTGGAAG GGGTGTACCT 114 0 

AGAGCCAAGG AAATTGGCTC TGGTTTGGAA AAATTTTGCT GTTATTATAG TAAACATACA 12 0 0 

AAGGATGTC 12 0 9 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .405 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . .405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



ATG AGT CCT 
Met Ser Pro 
1 

GTG AAT GGA 
Val Asn Gly 



TAC TCT GGA 
Tyr Ser Gly 
35 

GGG GGT GCA 
Gly Gly Ala 
50 

ATG AAC GAT 
Met Asn Asp 
65 

GCA AAA CAA 
Ala Lys Gin 



GTG AAA AAC 
Val Lys Asn 
5 

ACC ACG GGA 
Thr Thr Gly 
20 

GAT GTT ATG 
Asp Val Met 



CTG GTG CTG 
Leu Val Leu 



GAA AGC AGG 
Glu Ser Arg 
70 

CTG GGC TTC 
Leu Gly Phe 



AAT GTG GGC 
Asn Val Gly 



GCT GTG CTG 
Ala Val Leu 
25 

CAC CTA GTG 
His Leu Val 
40 

GTG GCC TCC 
Val Ala Ser 
55 

AAA CTC TTC 
Lys Leu Phe 



CGG GAC AGC 
Arg Asp Ser 



AGA GGC CTA 
Arg Gly Leu 
10 

GGA CAG AAG 
Gly Gin Lys 



AAA TTC CTT 
Lys Phe Leu 



TAC GAC GAT 
Tyr Asp Asp 
60 

TCT GAC TTG 
Ser Asp Leu 
75 

TGG GTC TTC 
Trp Val Phe 



AAC ATC GCC 
Asn lie Ala 
15 

GCA TTT GAC 
Ala Phe Asp 
30 

AAA GAA ATT 
Lys Glu lie 
45 

CCA GGG ACC 
Pro Gly Thr 



GGG AGT TCC 
Gly Ser Ser 



ATA GGA GCC 
He Gly Ala 



CTG 4 8 

Leu 



ATG 96 
Met 



CCG 144 
Pro 



AAA 192 
Lys 

TAC 240 
Tyr 
80 

AAA 288 
Lys 
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85 90 95 

GAC CTC AGG GGT AAA AGC CCC TTT GAG CAG TTC TTA AAG AAC AGC CCA 3 36 

Asp Leu Arg Gly Lys Ser Pro Phe Glu Gin Phe Leu Lys Asn Ser Pro 
100 105 110 

GAC ACA AAC AAA TAC GAG GGA TGG CCA GAG CTG CTG GAG ATG GAG GGC 3 84 

Asp Thr Asn Lys Tyr Glu Gly Trp Pro Glu Leu Leu Glu Met Glu Gly 
115 120 125 

TGC ATG CCC CCG AAG CCA TTT TAGGGTGGCT GTGGCTCTTC CTCAGCCAGG 43 5 

Cys Met Pro Pro Lys Pro Phe 
130 135 

GGCCTGAAGA AGYTCCTGCC TGCATTAGGA GTCANAGCCC GGCAGGCTGN AGGAGGAGGA 4 95 

GCAGGGGGTG CTGCGTGGAA GGTGCTGCAG GCCTTGCACG CTGTGTCGCG CCT 54 8 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ser Pro Val Lys Asn Asn Val Gly Arg Gly Leu Asn lie Ala Leu 
15 10 15 

Val Asn Gly Thr Thr Gly Ala Val Leu Gly Gin Lys Ala Phe Asp Met 
20 25 30 

Tyr Ser Gly Asp Val Met His Leu Val Lys Phe Leu Lys Glu He Pro 
35 40 45 

Gly Gly Ala Leu Val Leu Val Ala Ser Tyr Asp Asp Pro Gly Thr Lys 
50 55 60 

Met Asn Asp Glu Ser Arg Lys Leu Phe Ser Asp Leu Gly Ser Ser Tyr 
65 70 75 80 

Ala Lys Gin Leu Gly Phe Arg Asp Ser Trp Val Phe He Gly Ala Lys 
85 90 95 

Asp Leu Arg Gly Lys Ser Pro Phe Glu Gin Phe Leu Lys Asn Ser Pro 
100 105 HO 

Asp Thr Asn Lys Tyr Glu Gly Trp Pro Glu Leu Leu Glu Met Glu Gly 
115 120 125 

Cys Met Pro Pro Lys Pro Phe 
130 135 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2.. 685 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

T GTC TAC TCA AGG TAT TTC ACA ACT TAT GAC ACG AAT GGT* AGA TAC 4 6 

Val Tyr Ser Arg Tyr Phe Thr Thr Tyr Asp Thr Asn Gly Arg Tyr 
15 10 15 

AGT GTA AAA GTG CGG GCT CTG GGA GGA GTT AAC GCA GCC AGA CGG AGA 94 
Ser Val Lys Val Arg Ala Leu Gly Gly Val Asn Ala Ala Arg Arg Arg 
20 25 30 

GTG ATA CCC CAG CAG AGT GGA GCA CTG TAC ATA CCT GGC TGG ATT GAG 14 2 

Val lie Pro Gin Gin Ser Gly Ala Leu Tyr lie Pro Gly Trp lie Glu 
35 40 45 

AAT GAT GAA ATA CAA TGG AAT CCA CCA AGA CCT GAA ATT AAT AAG GAT 190 
Asn Asp Glu lie Gin Trp Asn Pro Pro Arg Pro Glu lie Asn Lys Asp 
50 55 60 

GAT GTT CAA CAC AAG CAA GTG TGT TTC AGC AGA ACA TCC TCG GGA GGC 23 8 

Asp Val Gin His Lys Gin Val Cys Phe Ser Arg Thr Ser Ser Gly Gly 
65 70 75 

TCA TTT GTG GCT TCT GAT GTC CCA AAT GCT CCC ATA CCT GAT CTC TTC 2 86 

Ser Phe Val Ala Ser Asp Val Pro Asn Ala Pro He Pro Asp Leu Phe 
80 85 90 95 

CCA CCT GGC CAA ATC ACC GAC CTG AAG GCG GAA ATT CAC GGG GGC AGT 3 34 

Pro Pro Gly Gin He Thr Asp Leu Lys Ala Glu He His Gly Gly Ser 
100 105 110 

CTC ATT AAT CTG ACT TGG ACA GCT CCT GGG GAT GAT TAT GAC CAT GGA 3 82 

Leu He Asn Leu Thr Trp Thr Ala Pro Gly Asp Asp Tyr Asp His Gly 
115 120 125 

ACA GCT CAC AAG TAT ATC ATT CGA ATA AGT ACA AGT ATT CTT GAT CTC 43 0 

Thr Ala His Lys Tyr He He Arg He Ser Thr Ser He Leu Asp Leu 
130 135 140 

AGA GAC AAG TTC AAT GAA TCT CTT CAA GTG AAT ACT ACT GCT CTC ATC 478 
Arg Asp Lys Phe Asn Glu Ser Leu Gin Val Asn Thr Thr Ala Leu He 
145 150 155 
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CCA AAG GAA GCC AAC TCT GAG GAA GTC TTT TTG TTT AAA CCA GAA AAC 5 26 

Pro Lys Glu Ala Asn Ser Glu Glu Val Phe Leu Phe Lys Pro Glu Asn 
160 165 170 175 

ATT ACT TTT GAA AAT GGC ACA GAT CTT TTC ATT GCT ATT CAG GCT GTT 5 74 

lie Thr Phe Glu Asn Gly Thr Asp Leu Phe lie Ala lie Gin Ala Val 
180 185 190 

GAT AAG GTC GAT CTG AAA TCA GAA ATA TCC AAC ATT GCA CGA GTA TCT 622 
Asp Lys Val Asp Leu Lys Ser Glu lie Ser Asn He Ala Arg Val Ser 
195 200 205 

TTG TTT ATT CCT CCA CAG ACT CCG CCA GAG ACA CCT AGT CCT GAT GAA 67 0 

Leu Phe He Pro Pro Gin Thr Pro Pro Glu Thr Pro Ser Pro Asp Glu 
210 215 220 

ACG TCT GCT CCT TGT GCCTAATATT CATATCAACA GCACCATTCC TGGCATTCAC 725 
Thr Ser Ala Pro Cys 
225 

ATTTTAAAAA TTATGTGGAA GTGGGTAGGA GAACTGCAGT TGTCAATAGN CTAGGGGTGA 7 85 

ATTTTTGTGC GGTGAATAAA TAATSATTTC ANCCTTTTTT TGRTTTATAA AAAAACGGNT 84 5 

NCCCATTGGG NNTNTNGNGG GGGGGNNTTT TAA 878 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Val Tyr Ser Arg Tyr Phe Thr Thr Tyr Asp Thr Asn Gly Arg Tyr Ser 
15 10 15 

Val Lys Val Arg Ala Leu Gly Gly Val Asn Ala Ala Arg Arg Arg Val 
20 25 30 

He Pro Gin Gin Ser Gly Ala Leu Tyr He Pro Gly Trp He Glu Asn 
35 40 45 

Asp Glu He Gin Trp Asn Pro Pro Arg Pro Glu He Asn Lys Asp Asp 
50 55 60 

Val Gin His Lys Gin Val Cys Phe Ser Arg Thr Ser Ser Gly Gly Ser 
65 70 75 80 

Phe Val Ala Ser Asp Val Pro Asn Ala Pro He Pro Asp Leu Phe Pro 
85 90 95 
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Pro Gly Gin lie Thr Asp Leu Lys Ala Glu lie His Gly Gly Ser Leu 
100 105 HO 

He Asn Leu Thr Trp Thr Ala Pro Gly Asp Asp Tyr Asp His Gly Thr 
115 120 125 

Ala His Lys Tyr He He Arg lie Ser Thr Ser He Leu Asp Leu Arg 
130 135 140 

Asp Lys Phe Asn Glu Ser Leu Gin Val Asn Thr Thr Ala Leu He Pro 
145 150 155 160 

Lys Glu Ala Asn Ser Glu Glu Val Phe Leu Phe Lys Pro Glu Asn He 
165 170 175 

Thr Phe Glu Asn Gly Thr Asp Leu Phe He Ala He Gin Ala Val Asp 
180 185 190 

Lys Val Asp Leu Lys Ser Glu He Ser Asn He Ala Arg Val Ser Leu 
195 200 205 

Phe He Pro Pro Gin Thr Pro Pro Glu Thr Pro Ser Pro Asp Glu Thr 
210 215 220 

Ser Ala Pro Cys 
225 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2 . .490 

(ix) FEATURE: 

(A) NAME/ KEY : mat_pept ide 

(B) LOCATION: 2.. 490 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

A GTC GCT CTC CTA GCC CTT CTC TGT GCC TCA CCC TCT GGC AAT GCC 4 6 

Val Ala Leu Leu Ala Leu Leu Cys Ala Ser Pro Ser Gly Asn Ala 
1 5 10 15 

ATT CAG GCC AGG TCT TCC TCC TAT AGT GGA GAG TAT GGA GGT GGT GGT 94 
He Gin Ala Arg Ser Ser Ser Tyr Ser Gly Glu Tyr Gly Gly Gly Gly 
20 25 30 
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a. art 




TTC 


TCT 


CAT 


TCT 


GGC 


AAC 


CAG 


TTG 


GAC 


GGC 


CCC 


ATC 


ACC 


142 


Gly 


Lys 


Arg 


Phe 


Ser 


His 


Ser 


Gly 


Asn 


Gin 


Leu 


Asp 


Gly 


Pro 


He 


Thr 






35 










40 










4 5 








GCC 


CTC 


CGG 


GTC 


CGA 


GTC 


AAC 


ACA 


TAC 


TAC 


ATC 


GTA 


GGT 


CTT 




bib 


iyu 




Lsu 


Arg 


Val 


Arg 


Val 


Asn 


Thr 


Tyr 


Tyr 


He 


Val 


Gly 


Leu 


Gin 


Val 








50 








55 










60 










CGC 


TAT 


GGC 


AAG 


GTG 


TGG 


AGC 


GAC 


TAT 


GTG 


GGT 


GGT 


CGC 


AAC 


GGA 


GAC 


23 8 


Arg 


Tyr 






Val 


Tm 


Ser 


Asp 


Tyr 


Val 


Gly 


Gly 


Arg 


Asn 


Gly 


Asp 




65 










70 










75 












CTG 


GAG 


GAG 


ATC 


TTT 


CTG 


CAC 


CCT 


GGG 


GAA 


TCA 


GTG 


ATC 


CAG 


GTT 


TCT 


2 8 6 


Leu 


Glu 


CjIU 


lie 




Leu 


His 


P ro 


Glv 


Glu 


Ser 


Val 


He 


Gin 


Val 


Ser 




80 










85 










90 










95 




GGG 


AAG 


TAC 


AAG 


TGG 


TAC 


CTG 


AAG 


AAG 


CTG 


GTA 


TTT 


GTG 


ACA 


GAC 


AAG 


334 


Giy 


Lys 


Tyr 


Lys 


Trp 




Leu 


Lys 


Lys 


Leu 


Val 


Phe 


Val 


Thr 


Asp 


Lys 








100 










105 










110 






GGC 


CGC 


TAT 


CTG 


TCT 


TTT 


GGG 


AAA 


GAC 


AGT 


GGC 


ACA 


AGT 


TTC 


AAT 


GCC 


3 8 2 


(jiy 


Arg 


Tyr 


Leu 


Car 


Phe 


Glv 


Lys 


Asd 


Ser 


Gly 


Thr 


Ser 


Phe 


Asn 


Ala 




115 










120 










125 








GTC 


ccc 


TTG 


CAC 


CCC 


AAC 


ACC 


GTG 


CTC 




rnrpp 




APT 




CGG 


TCT 


43 0 


Val 


Pro 


Leu 


His 


Pro 


Asn 


Thr 


Val 


Leu 


Arg 


Phe 


He 


Ser 


Gly 


Arg 


Ser 








130 










135 










140 










GGT 


TCT 


CTC 


ATC 


GAT 


GCC 


ATT 


GGC 


CTG 


CAC 


TGG 


GAT 


GTT 


TAC 


CCC 


ACT 


478 


Gly 


Ser 


Leu 


He 


Asp 


Ala 


He 


Gly 


Leu 


His 


Trp 


Asp 


Val 


Tyr 


Pro 


Thr 





145 150 155 

AGC TGC AGC AGA TGCTGAGCCT CCTCTCCTTG GCAGGGGCAC TGTGATGAGG 53 0 

Ser Cys Ser Arg 

160 

AGTAAGAACT CCTTATCACT AACCCCCATC 560 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Val Ala Leu Leu Ala Leu Leu Cys Ala Ser Pro Ser Gly Asn Ala lie 
1 5 10 15 

Gin Ala Arg Ser Ser Ser Tyr Ser Gly Glu Tyr Gly Gly Gly Gly Gly 
20 25 30 

Lys Arg Phe Ser His Ser Gly Asn Gin Leu Asp Gly Pro lie Thr Ala 
3 5 4 0 4 5 
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Leu Arg Val Arg Val Asn Thr Tyr Tyr lie Val Gly Leu Gin Val Arg 
50 55 60 

Tyr Gly Lys Val Trp Ser Asp Tyr Val Gly Gly Arg Asn Gly Asp Leu 
65 70 75 80 

Glu Glu lie Phe Leu His Pro Gly Glu Ser Val lie Gin Val Ser Gly 

85 90 95 

Lys Tyr Lys Trp Tyr Leu Lys Lys Leu Val Phe Val Thr Asp Lys Gly 
100 105 110 

Arg Tyr Leu Ser Phe Gly Lys Asp Ser Gly Thr Ser Phe Asn Ala Val 
115 120 125 

Pro Leu His Pro Asn Thr Val Leu Arg Phe He Ser Gly Arg Ser Gly 
130 135 140 

Ser Leu He Asp Ala He Gly Leu His Trp Asp Val Tyr Pro Thr Ser 
145 150 155 160 

Cys Ser Arg 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



r 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

TAAACTTGCT GTTTTGTTCC TGTGTCTTGT CTTTGGTTGG TATTTCAGTA AGTTTTTGGT 6 0 

ATTCTCAAAT TTTATCTAAA TGGATAAACT ATTAACATAG AACATAAACC CCAATTCTCC 12 0 

ATTTCATTTT TCTCTTAGGC ATGAATCATA CAAAACTCAA TAT AG AG C AA TGTTTGTAAT 18 0 

GAATTGTTCT ATTAACAAAG AGGAGGTTCT AAGATATAAA G C C T C AG AG A ACAGGAAGAA 24 0 

AAGGCGGGTC CATAAGAAGA TGAGGTCTAA CCGGGAAGAT GCTGCTGAGA AGGCAGAGAC 3 00 

AGATGTGGAA GAAATCTATC ACCCAGTCAT GTGCACTGAA TGTTCCACTG AAGTGGCAGT 3 60 

TTACGACAAG GATGAAGTCT TTCATTTTTT CAATGTTTTA GCAAGCCATT CCTAAACAGC 42 0 

CCAACTGGCA TTTAATTACC CAATACTGTA TATAAGGCAA ATATGGACAG TTACTTTCCT 4 80 

CTTGCCTGTT CATATCCTTC AGTGACATTG AGGAAGCAGT GTTTCTCTTT TTAAAGGGGA 54 0 
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ATAGTTGTCA ACCTTCATTC ATCTCTTACA TCTTTCACCC TCTCCTTTTT TTTTTCTTTG 6 00 

ATTTTCCCCC TTATTGATGG GACTGATATT CATTCTGTTT TTGATGAACA TTTGGAAACT 6 60 

GTCGGGCTTT TTATTAAAGC TCTGTAGAAT TAAAATGTTC TGGAATTAT 7 09 
(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 125.. 367 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 125.. 367 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CAGGAGGGAG AGCCTTCCCC AAGCAAACAA TCCAGAGCAG CTGTGCAAAC AACGGTGCAT 60 

AAATAAGGCC TCCTGGACCA TGAATGCGAG TCCGCTGAGC TGCGTACCGG AGCCCACGGT 12 0 

GGTC ATG GCT GCC AGA GCG CTC TGC ATG CTG GGG CTG GTC CTG GCC TTG 169 
Met Ala Ala Arg Ala Leu Cys Met Leu Gly Leu Val Leu Ala Leu 
15 10 15 

CTG TCC TCC AGC TCT GCT GAG GAG TAC GTG GGC CTG TCT GCA AAC CAG 217 
Leu Ser Ser Ser Ser Ala Glu Glu Tyr Val Gly Leu Ser Ala Asn Gin 
20 25 30 

TGT GCC GTG CCA GCC AAG GAC AGG GTG GAC TGC GGC TAC CCC CAT GTC 2 65 

Cys Ala Val Pro Ala Lys Asp Arg Val Asp Cys Gly Tyr Pro His Val 
35 40 45 

ACC CCC AAG GAG TGC AAC AAC CGG GGC TGC TGC TTT GAC TCC AGG ATC 313 
Thr Pro Lys Glu Cys Asn Asn Arg Gly Cys Cys Phe Asp Ser Arg lie 
50 55 60 

CCT GGA GTG CCT TGG TGT TTC AAG CCC CTG ACA GGG AAG CAG GAA TGC 3 61 

Pro Gly Val Pro Trp Cys Phe Lys Pro Leu Thr Gly Lys Gin Glu Cys 
65 70 75 

ACC TTC TGAGGCACCT CCAGCTGCCC CCCGGCCGGG GGATGCGAGG CTCGG AG C AC 417 
Thr Phe 
80 

CCTTGCCCGG CTGTGATTGC TGCCAGGCAC TGTTCATCTC AGCTTTTCTG TCCCTTTGCT 4 77 
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CCCGGAAGCG CTTCTGCTGA AAGTTCATAT CTGGAGCCTG ATGTTTAACG TAGTCCCATG 53 7 

CTCCACCCGA AAAAAAAAAA AAAAAAAAAA AAA 5 70 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Met Ala Ala Arg Ala Leu Cys Met Leu Gly Leu Val Leu Ala Leu Leu 
15 10 15 

Ser Ser Ser Ser Ala Glu Glu Tyr Val Gly Leu Ser Ala Asn Gin Cys 

20 25 30 

Ala Val Pro Ala Lys Asp Arg Val Asp Cys Gly Tyr . Pro His Val Thr 

35 40 45 

Pro Lys Glu Cys Asn Asn Arg Gly Cys Cys Phe Asp Ser Arg lie Pro 
50 55 60 

Gly Val Pro Trp Cys Phe Lys Pro Leu Thr Gly Lys Gin Glu Cys Thr 
65 70 75 80 

Phe 

(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 42.. 1010 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 42.. 1010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 

AAGCTCTTCT CACAGGACCA GCCACTAGCG CAGCTCGAGC G ATG GCC TAT GTC 53 

Met Ala Tyr Val 
1 
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CCC GCA CCG GGC TAC CAG CCC ACC TAC AAC CCG ACG CTG CCT TAC TAC 101 
Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr Leu Pro Tyr Tyr 
5 10 15 20 

CAG CCC ATC CCG GGC GGG CTC AAC GTG GGA ATG TCT GTT TAC ATC CAA 14 9 

Gin Pro lie Pro Gly Gly Leu Asn Val Gly Met Ser Val Tyr lie Gin 
25 30 35 

GGA GTG GCC AGC GAG CAC ATG AAG CGG TTC TTC GTG AAC TTT GTG GTT 197 
Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val Asn Phe Val Val 
40 45 50 

GGG CAG GAT CCG GGC TCA GAC GTC GCC TTC CAC TTC AAT CCG CGG TTT 24 5 

Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe Asn Pro Arg Phe 
55 60 65 

GAC GGC TGG GAC AAG GTG GTC TTC AAC ACG TTG CAG GGC GGG AAG TGG 2 93 

Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin Gly Gly Lys Trp 
70 75 80 

GGC AGC GAG GAG AGG AAG AGG AGC ATG CCC TTC AAA AAG GGT GCC GCC 341 
Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys Lys Gly Ala Ala 
85 90 95 100 

TTT GAG CTG GTC TTC ATA GTC CTG GCT GAG CAC TAC AAG GTG GTG GTA 3 89 

Phe Glu Leu Val Phe lie Val Leu Ala Glu His Tyr Lys Val Val Val 
105 110 115 

AAT GGA AAT CCC TTC TAT GAG TAC GGG CAC CGG CTT CCC CTA CAG ATG 43 7 

Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu Pro Leu Gin Met 
120 125 130 

GTC ACC CAC CTG CAA GTG GAT GGG GAT CTG CAA CTT CAA TCA ATC AAC 4 85 

Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu Gin Ser lie Asn 
135 140 145 

TTC ATC GGA GGC CAG CCC CTC CGG CCC CAG GGA CCC CCG ATG ATG CCA 53 3 

Phe lie Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro Pro Met Met Pro 
150 155 160 

CCT TAC CCT GGT CCC GGA CAT TGC CAT CAA CAG CTG AAC AGC CTG CCC 581 
Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu Asn Ser Leu Pro 
165 170 175 180 

ACC ATG GAA GGA CCC CCA ACC TTC AAC CCG CCT GTG CCA TAT TTC GGG 62 9 

Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val Pro Tyr Phe Gly 
185 190 195 

AGG CTG CAA GGA GGG CTC ACA GCT CGA AGA ACC ATC ATC ATC AAG GGC 677 
Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr lie lie lie Lys Gly 
200 205 210 

TAT QTG CCT CCC ACA GGC AAG AGC TTT GCT ATC AAC TTC AAG GTG GGC 72 5 

Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala lie Asn Phe Lys Val Gly 
215 220 225 
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TCC TCA GGG GAC ATA GCT CTG CAC ATT AAT CCC CGC ATG GGC AAC GGT 773 
Ser Ser Gly Asp lie Ala Leu His lie Asn Pro Arg Met Gly Asn Gly 
230 235. 240 

ACC GTG GTC CGG AAC AGC CTT CTG AAT GGC TCG TGG GGA TCC GAG GAG 821 
Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp Gly Ser Glu Glu 
245 250 255 260 

AAG AAG ATC ACC CAC AAC CCA TTT GGT CCC GGA CAG TTC TTT GAT CTG 869 
Lys Lys lie Thr His Asn Pro Phe Gly Pro Gly Gin Phe Phe Asp Leu 
265 270 275 

TCC ATT CGC TGT GGC TTG GAT CGC TTC AAG GTT TAC GCC AAT GGC CAG 917 
Ser lie Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala Asn Gly Gin 
280 285 290 

CAC CTC TTT GAC TTT GCC CAT CGC CTC TCG GCC TTC CAG AGG GTG GAC 965 
His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe Gin Arg Val Asp 
295 300 305 

ACA TTG GAA ATC CAG GGT GAT GTC ACC TTG TCC TAT GTC CAG ATC 1010 
Thr Leu Glu lie Gin Gly Asp Val Thr Leu Ser Tyr Val Gin lie 
310 315 320 

TAATCTATTC CTGGGGCCAT AACTCATGGG AAAACAGAAT TATCCCCTAG GACTCCTTTC 10 7 0 

TAAGCCCCTA ATAAAATGTC TGAGGGTGTC TCAAAAAAAA AAAAAAAAAA A 1121 

(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 23 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Tyr Val. Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 
1 5 10 15 

Leu Pro Tyr Tyr Gin Pro lie Pro Gly Gly Leu Asn Val Gly Met Ser 
20 25 30 

Val Tyr lie Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val 
35 40 45 

Asn Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe 
50 55 60 

Asn Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin 
65 70 75 80 

Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys 
85 90 95 
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Lys Gly Ala Ala Phe Glu Leu Val Phe lie Val Leu Ala Glu His Tyr 
100 105 110 

Lys Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu 
115 120 125 

Pro Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu 
130 135 140 

Gin Ser lie Asn Phe lie Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro 
145 150 155 160 

Pro Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu 
165 170 175 

Asn Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val 
180 185 190 

Pro Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr lie 
195 200 205 

lie lie Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala lie Asn 
210 215 220 

Phe Lys Val Gly Ser Ser Gly Asp He Ala Leu His He Asn Pro Arg 
225 230 235 240 

Met Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp 
245 250 255 

Gly Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin 
- 260 265 270 

Phe Phe Asp Leu Ser He Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr 
275 280 285 

Ala Asn Gly Gin His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe 

290 295 300 

Gin Arg Val Asp Thr Leu Glu He Gin Gly Asp Val Thr Leu Ser Tyr 

305 310 315 320 

Val Gin He 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY : linear 

( i i ) MOLECULE TYPE : cDNA 



(ix) FEATURE : 
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(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . . 603 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . . 603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 

GTT GAT ATT AAA ACC AGT GAA ACC AAA CAT GAC ACC TCT CTG AAA CCT 4 8 

Val Asp He Lys Thr Ser Glu Thr Lys His Asp Thr Ser Leu Lys Pro 
! 5 10 15 

ATT AGT GTC TCC TAC AAC CCA GCC ACA GCC AAA GAA ATT ATC AAT GTG 96 

He Ser Val Ser Tyr Asn Pro Ala Thr Ala Lys Glu He He Asn Val 

20 25 30 

GGG CAT TCC TTC CAT GTA AAT TTT GAG GAC AAC GAT AAC CGA TCA GTG 14 4 

Gly His Ser Phe His Val Asn Phe Glu Asp Asn Asp Asn Arg Ser Val 
35 40 45 

CTG AAA GGT GGT CCT TTC TCT GAC AGC TAC AGG CTC TTT CAG TTC CAT 192 

Leu Lys Gly Gly Pro Phe Ser Asp Ser Tyr Arg Leu Phe Gin Phe His 
50 55 60 

TTT CAC TGG GGC AGT ACA AAT GAG CAT GGT TCA GAA CAT ACA GTG GAT 240 

Phe His Trp Gly Ser Thr Asn Glu His Gly Ser Glu His Thr Val Asp 
65 70 75 80 

GGA GTC AAA TAT TCT GCC GAG CTT CAC GTG GCT CAC TGG AAT TCT GCA 28 8 

Gly Val Lys Tyr Ser Ala Glu Leu His Val Ala His Trp Asn Ser Ala 
85 90 95 

AAG TAC TCC AGC CTT GCT GAA GCT GCC TCA AAG GCT GAT GGT TTG GCA 336 

Lys Tyr Ser Ser Leu Ala Glu Ala Ala Ser Lys Ala Asp Gly Leu Ala 

100 105 HO 

GTT ATT GGT GTT TTG ATG AAG GTT GGT GAG GCC AAC CCA AAG CTG CAG 3 84 

Val He Gly Val Leu Met Lys Val Gly Glu Ala Asn Pro Lys Leu Gin 
115 120 125 

AAA GTA CTT GAT GCC CTC CAA GCA ATT AAA ACC AAG GGC AAA CGA GCC 43 2 

Lys Val Leu Asp Ala Leu Gin Ala He Lys Thr Lys Gly Lys Arg Ala 

130 135 140 

CCA TTC ACA AAT TTT GAC CCC TCT ACT CTC CTT CCT TCA TCC CTG GAT 

Pro Phe Thr Asn Phe Asp Pro Ser Thr Leu Leu Pro Ser Ser Leu Asp 
145 150 155 160 

TTC TGG ACC TAC CCT GGC TCT CTG ACT CAT CCT CCT CTT TAT GAG AGT 52 8 

Phe Trp Thr Tyr Pro Gly Ser Leu Thr His Pro Pro Leu Tyr Glu Ser 
165 170 175 

GTA ACT TGG ATC ATC TGT AAG GAG AGC ATC AGT GTC AGT TCA GAG CAG 576 

Val Thr Trp He He Cys Lys Glu Ser He Ser Val Ser Ser Glu Gin 

180 185 190 



480 
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TTG GCA CAA TTC CGG 
Leu Ala Gin Phe Arg 
195 



AGC CTT CTA TCA AT 
Ser Leu Leu Ser 
200 



605 



(2) INFORMATION FOR 



SEQ ID NO : 18 : 



(i) SEQUENCE 



CHARACTERISTICS : 



(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Val Asp lie Lys Thr Ser Glu Thr Lys His Asp Thr Ser Leu Lys Pro 
15 10 15 

lie Ser Val Ser Tyr Asn Pro Ala Thr Ala Lys Glu lie lie Asn Val 
20 25 30 

Gly His Ser Phe His Val Asn Phe Glu Asp Asn Asp Asn Arg Ser Val 
35 40 45 

Leu Lys Gly Gly Pro Phe Ser Asp Ser Tyr Arg Leu Phe Gin Phe His 
50 55 60 

Phe His Trp Gly Ser Thr Asn Glu His Gly Ser Glu His Thr Val Asp 
65 70 75 80 

Gly Val Lys Tyr Ser Ala Glu Leu His Val Ala His Trp Asn Ser Ala 
85 90 95 

Lys Tyr Ser Ser Leu Ala Glu Ala Ala Ser Lys Ala Asp Gly Leu Ala 
100 105 110 

Val lie Gly Val Leu Met Lys Val Gly Glu Ala Asn Pro Lys Leu Gin 
115 120 125 

Lys Val Leu Asp Ala Leu Gin Ala lie Lys Thr Lys Gly Lys Arg Ala 
130 135 140 

Pro Phe Thr Asn Phe Asp Pro Ser Thr Leu Leu Pro Ser Ser Leu Asp 
145 150 155 160 

Phe Trp Thr Tyr Pro Gly Ser Leu Thr His Pro Pro Leu Tyr Glu Ser 
165 170 175 

Val Thr Trp lie He Cys Lys Glu Ser He Ser Val Ser Ser Glu Gin 

180 185 190 

Leu Ala Gin Phe Arg Ser Leu Leu Ser 



195 



200 



(2) INFORMATION FOR SEQ ID NO : 19 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2 . .469 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 2 . .469 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

C GGC TCC GGG CGG GCG TGG CCA GTG ACT AGA AGG CGA GGC GCC GCG 4 6 

Gly Ser Gly Arg Ala Trp Pro Val Thr Arg Arg Arg Gly Ala Ala 
15 10 15 

GGA CCA TGG CGG CGG CGG CGG ACG AGC GGA GTC CAG AGG CGA GAA GAC 94 
Gly Pro Trp Arg Arg Arg Arg Thr Ser Gly Val Gin Arg Arg Glu Asp 
20 25 30 

GAG GAA GAG GAG GAG CAG TTG GTT CTG GTG GAA TTA TCA GGA ATT ATT 14 2 

Glu Glu Glu Glu Glu Gin Leu Val Leu Val Glu Leu Ser Gly He He 
35 40 45 

GAT TCA GAC TTC CTC TCA AAA TGT GAA AAT AAA TGC AAG GTT TTG GGC 190 
Asp Ser Asp Phe Leu Ser Lys Cys Glu Asn Lys Cys Lys Val Leu Gly 
50 55 60 

ATT GAC ACT GAG AGG CCC ATT CTG GCA ATG GAC AGC TGT GTC TTT GCT 23 8 

He Asp Thr Glu Arg Pro He Leu Ala Met Asp Ser Cys Val Phe Ala 
65 70 75 

GGG GAG TAT GAA GAC ACT CTA GGG ACC TGT GTT ATA TTT GAA GAA AAT 286 
Gly Glu Tyr Glu Asp Thr Leu Gly Thr Cys Val He Phe Glu Glu Asn 
80 85 90 95 

GTT GAA CAT GCT GAT ACA GAA GGC AAT AAT AAA ACA GTG CTA AAA TAT 334 
Val Glu His Ala Asp Thr Glu Gly Asn Asn Lys Thr Val Leu Lys Tyr 
100 105 110 

AAA TGC CAT ACA ATG AAG AAG CTC AGC ATG ACA AGA ACT CTC CTG ACA 3 82 

Lys Cys His Thr Met Lys Lys Leu Ser Met Thr Arg Thr Leu Leu Thr 
115 120 125 

GAG AAG AAG GAA GGA GAA GAA AAC ATA GGT GGG GTG GAA TGG CTG CAA 43 0 

Glu Lys Lys Glu Gly Glu Glu Asn He Gly Gly Val Glu Trp Leu Gin 
130 135 140 

ATA AGG ATA TGG TTT CTC CCT TTG ACC CAA CAG GTT TGT TAACTTTTCT 47 9 

lie Arg He Trp Phe Leu Pro Leu Thr Gin Gin Val Cys 
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145 150 155 

ACCATGAAAT TGAGGACGAG GAAGTGGTAG CTTTCAGCCC CGTTAAATCT TTGGATTTGG 53 9 

GAGGGGGTGG GGTTTCAATG 5 59 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Gly Ser Gly Arg Ala Trp Pro Val Thr Arg Arg Arg Gly Ala Ala Gly 
15 10 15 

Pro Trp Arg Arg Arg Arg Thr Ser Gly Val Gin Arg Arg Glu Asp Glu 
20 25 30 

Glu Glu Glu Glu Gin Leu Val Leu Val Glu Leu Ser Gly lie lie Asp 
35 40 45 

Ser Asp Phe Leu Ser Lys Cys Glu Asn Lys Cys Lys Val Leu Gly lie 
50 55 60 

Asp Thr Glu Arg Pro lie Leu Ala Met Asp Ser Cys Val Phe Ala Gly 
65 70 75 80 

Glu Tyr Glu Asp Thr Leu Gly Thr Cys Val lie Phe Glu Glu Asn Val 
85 90 95 . 

Glu His Ala Asp Thr Glu Gly Asn Asn Lys Thr Val Leu Lys Tyr Lys 
100 105 110 

Cys His Thr Met Lys Lys Leu Ser Met Thr Arg Thr Leu Leu Thr Glu 
115 120 125 

Lys Lys Glu Gly Glu Glu Asn lie Gly Gly Val Glu Trp Leu Gin lie 
130 135 140 

Arg lie Trp Phe Leu Pro Leu Thr Gin Gin Val Cys 
145 . 150 155 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

GTGGCAGAAG AAAGATAGGT TGGAGACAAT TGATTGCTCG ATGATATAAA ATGTTAAGTA 60 

CCATGAATGN ATGCTGTTAG GCTGGAATGC GCCAAGATAA AAGGTGGGGC ATGGCATCAA 12 0 

AAGGTAGGTC AACATATTAA ATAATTCCAT GTATTGAAAT ATCCAGAAAA TATATAGACA 180 

GATCTATAGA GATAGAAACT GGTCTGCCCA GGACTAGGGG TTGTCTAAGG ATAAGGAGCT 24 0 

TCTTTTTTGG ATGGTGAAAT AACCTAAAAT ATATTGTGCC ATTGTTTGCA CAACTTTGTG 3 00 

GAATATATTA AAAACCGGTT AATTGTACTC ACTAAAATGT CCTCCTTCTT AAATTTAAGC 3 60 

TGTTTNCTGG ACAAGAAAAA GGGAAAGNNA CCAAGGGGNA AAAAATTTT 409 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCCCTGGGCT TTGGGGGGGT CCCAAACATG GTATGCAGAA ATGTGATGGT TACAGGTCAG 6 0 

TACAACCTCA GTCCTTAGAA CCCCTCCACA CTTCAGCTCT GCACCCACTT TCCTGTCATT 12 0 

TATTTATATA GGACTGTAGT TTTTTTTAGT TCGAGAGCCT TTCGAAGCTT AATTTATATT 18 0 

CTTTCTTTGT ACCTTTTTTC TAAAATTACC AAAGATATTA CACAAAGGTA AATTAATGTT 24 0 

CTCTGTTTTA TGCTTTATCT GATGGAGGCA AATATCCTCT TATTGTTGAT CAAAGGGGGC 3 00 

AAAAGAATTT AGAGGCAAAT GAACAAGCGA TAGGCTATTG CAACCTGAGA AAGAGAACTG 3 60 

NTCCTTCCAT CGTAAATTTA GNAGNCCAAG TAGGTAATGG GAACCAAAGT TGTTACTTTT 42 0 

TTCTAGTAGT TATTTTTCCC TTTTTNNTTT TTGTGGTACC TCTTACAGNG NCCCAAAACT 48 0 

CCATTCTCTT TAAAGGGGTT TTTATGGGGG GCTTACTGCA GGTTAAAAAT TGGGGNC C AC 54 0 

CATTTTTAAA GGGGGGCTAC CAGAAGGGAG GGGGGTCCCC NTTNCNAAAA AAAAAAATTG 60 0 

(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS': single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
ATGCTTCCGG CTCGTATG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GGGTTTTCCC AGTCACGAC 
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